Sample Pages to Be Followed Exactly in Preparing Scripts Generalization of Reinforcement Learning with Cmac
نویسندگان
چکیده
To implement a generalization of value functions in Adaptive Search Element (ASE)-reinforcement learning, CMAC is integrated into ASE controller. ASEreinforcement learning scheme is briefly studied to discuss how CMAC is integrated into ASE controller. Neighbourhood Sequential Training concept is utilized to establish the look-up table of CMAC and to produce discrete control outputs. In computer simulation, an ASE controller and a couple of ASE-CMAC neural network are trained to balance the inverted pendulum on a cart. The number of trials until the controllers are established and the learning performance of the controllers are evaluated to find that generalization ability of the CMAC improves the speed of the ASE-reinforcement learning enough to realize the cartpole control system. Copyright© 2005 IFAC
منابع مشابه
Two Novel Learning Algorithms for CMAC Neural Network Based on Changeable Learning Rate
Cerebellar Model Articulation Controller Neural Network is a computational model of cerebellum which acts as a lookup table. The advantages of CMAC are fast learning convergence, and capability of mapping nonlinear functions due to its local generalization of weight updating, single structure and easy processing. In the training phase, the disadvantage of some CMAC models is unstable phenomenon...
متن کاملSample Pages to Be Followed Exactly in Preparing Scripts Adaptive Control of a Coupled Drives Apparatus Using Dual Youla-kucera Parametrization
An adaptive algorithm based on the dual Youla-Kucera parametrization is introduced enabling simple closed-loop identification and adaptation of a class of symmetric MIMO systems. The methodology exploits the algebraic approach to control system design. Necessary conditions for usage of the developed method are discussed and results are presented for the case of coupled drives control. Copyright...
متن کاملA Q-learning with Selective Generalization Capability and its Application to Layout Planning of Chemical Plants
Under environments that the criteria to achieve a certain objective is unknown, the reinforcement learning is known to be effective to collect, store and utilize information returned from the environments. Without a supervisor, the method can construct criteria for evaluation of actions to achieve the objective. However, since the information received by a learning agent is obtained through an ...
متن کاملWeb pages ranking algorithm based on reinforcement learning and user feedback
The main challenge of a search engine is ranking web documents to provide the best response to a user`s query. Despite the huge number of the extracted results for user`s query, only a small number of the first results are examined by users; therefore, the insertion of the related results in the first ranks is of great importance. In this paper, a ranking algorithm based on the reinforcement le...
متن کاملSample Pages to Be Followed Exactly in Preparing Scripts Persistent Motion and Chaos in Attitude Control with Switching Actuators
In systems with switching actuators persistent motions of different nature may occur, such as limit cycles, quasi-periodic and chaotic motions. In this contribution the nature of persistent motions in an attitude control system with switching actuators subject to switching restrictions are examined as a function of controller parameters. Bifurcation diagrams are used to describe observations. I...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005